System and method for performing application level persistence

ABSTRACT

A method and apparatus for inserting and examining Cookies in the data streams of HTTP connections for the purpose of persistently directing HTTP connections to the same destination. A network device directs subsequent HTTP connections from the same client to the same server (destination) for accessing the requested resources. There are four modes for employing the Cookie to persistently direct HTTP connections. The associated mode inserts a Cookie that uniquely identifies the client into an HTTP response. The passive mode inserts Cookie information that uniquely identifies a previously selected destination into an HTTP response. In the rewrite mode, a network device manages the destination information that is rewritten over blank Cookie information generated by the destination producing the HTTP response. The insert mode inserts and removes Cookie information in the data packets for HTTP requests and response prior to processing by the destination.

RELATED APPLICATIONS

This utility patent application is a Continuation of U.S. patent application Ser. No. 11/260,651, filed on Oct. 26, 2005 and issued as U.S. Pat. No. 7,346,695, which is a Continuation of U.S. patent application Ser. No. 11/235,643, filed on Sep. 26, 2005 and issued as U.S. Pat. No. 7,287,084, and a Divisional of U.S. patent application Ser. No. 10/284,035, filed on Oct. 28, 2002 and issued as U.S. Pat. No. 6,970,933, which is a Continuation of U.S. patent application Ser. No. 10/006,555, filed on Dec. 4, 2001 and issued as U.S. Pat. No. 6,473,802, which is a Continuation of U.S. patent application Ser. No. 09/353,335, filed on Jul. 15, 1999 and issued as U.S. Pat. No. 6,374,300, the benefits of which are claimed under 35 U.S.C. §120, and are further incorporated herein by reference.

FIELD OF THE INVENTION

This application relates generally to distributing the load demand between servers on a network, and, more specifically, to employing an HTTP cookie to balance the load demand between servers on a wide area network of geographically distributed servers such as the Internet.

BACKGROUND OF THE INVENTION

Generally, it has proven difficult to reliably and efficiently load balance the demand for access to resources, e.g., a web-based application, email and streamed multimedia data, on a wide area network (WAN). One prior art attempt employed a look up table for storing a relationship mapping between a client's IP address and the IP address of the actual server that provided access to the resources for a domain name/IP address request. This table was usually held in the memory of a server array controller that managed several node servers that could provide access to the resources associated with the client's request. Typically, the server array controller would employ a load balancing technique to select and map the IP address of one of the managed node servers to the client's actual IP address and store this mapped relationship with a time stamp in the table. In this way, when a client repeated a request before the expiration of the time stamp, the controller would use the mapping stored in the table to automatically connect the client to the previously selected (load balanced) node server.

Additionally, if the time stamp had expired, the server array controller would again perform the load balancing technique to select one of the managed node servers to provide the actual access to the resources associated with the request. Each time the load balancing technique was performed, the controller would update the table to include a new time stamp and a new mapping of the client's unique IP address to the currently selected node server's IP address.

For a relatively small number of client requests, the above described prior art solution could reduce the demand on server array controller resources because the controller did not always have to perform a load balancing technique for each client request that occurred before the expiration of the time stamp. Instead, the controller only performed the load balancing technique for a new client request when the time stamp for a previous client request was expired. However, since all of the table entries had to be kept in the memory of the server array controller to be used effectively, the available controller resources for load balancing and managing several node servers decreased in proportion to an increase in the number of client requests. To ensure that table entries were not lost when the server array controller lost power or was rebooted, a copy of the table would be stored on a secondary storage medium. Also, under heavy load conditions, the secondary storage medium was often not fast enough to store the copy of table entries before the server array controller shut down.

Another significant problem with the prior art approach was that the client's IP address was not always unique. Although some clients might have their own unique IP address, many others used random virtual client IP addresses provided by a large Internet Service Provider (ISP), e.g., the America On-Line Corporation, to connect to the Internet. Since only a portion of a large ISP's clients are typically connected at any one time, a large ISP usually employs a proxy cache to randomly assign a relatively small number of virtual client IP addresses to the currently “on-line” (customers) clients. Typically, a proxy cache will assign one of the virtual client IP addresses to a client on a first available basis each time the client connects to the ISP and starts a session on the Internet. From the discussion above, it is apparent that when a client used a large ISP to connect to a WAN such as the Internet, the prior art did not provide an effective method for persistently mapping a client's relationship to the server that was selected to provide access to resources associated with a request.

Therefore, it is desirable to provide a method and system for automatically providing a persistent mapping of a previously selected destination for a domain name/IP address request. Preferably, the present invention employs a Cookie in a Hyper Text Transport Protocol (HTTP) data stream to identify a relationship between a previously selected destination and a client's HTTP request. The present invention overcomes many of the limitations of the prior art caused by the direct mapping of an actual destination IP address to a client's IP address.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing aspects and many of the attendant advantages of this invention will become more readily appreciated as the same become better understood by reference to the following detailed description, when taken in conjunction with the accompanying drawings, wherein:

FIG. 1A is a schematic overview of a system for processing an HTTP request that does not include a Cookie;

FIG. 1B is a schematic overview of a system for processing an HTTP request that does include a Cookie;

FIG. 2A is a flow chart showing an overview for processing an HTTP request that does not include a Cookie;

FIG. 2B is a flow chart illustrating an overview for processing an HTTP request that does include a Cookie;

FIG. 3A is a flow chart showing an associative mode for processing an HTTP request that does not include a Cookie;

FIG. 3B is a flow chart illustrating an associative mode for processing an HTTP request that does include a Cookie;

FIG. 4A is a flow chart showing a passive mode for processing an HTTP request that does not include a Cookie;

FIG. 4B is a flow chart illustrating a passive mode for processing an HTTP request that does include a Cookie;

FIG. 5A is a flow chart showing a rewrite mode for processing an HTTP request that does not include a Cookie;

FIG. 5B is a flow chart illustrating a rewrite mode for processing an HTTP request that does include a Cookie;

FIG. 6A is a flow chart showing an insert mode for processing an HTTP request that does not include a Cookie;

FIG. 6B is a flow chart illustrating an insert mode for processing an HTTP request that does include a Cookie;

FIGS. 7A-7C show exemplary code fragments of HTTP requests that include Cookies;

FIGS. 7D and 7E illustrate exemplary code fragments of HTTP responses that include information for setting Cookies at the senders of the associated HTTP requests;

FIG. 8 shows the buffering of communication between a client and a node server by a proxy server; and

FIG. 9 illustrates an exemplary computer system for the client.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

The present invention is directed to inserting and examining HTTP Cookies in the data streams of HTTP connections for the purpose of persistently directing HTTP connections to the same destination. The present invention enables a network transmission device, e.g., a router, to reliably and conveniently direct subsequent HTTP connections from the same client to the same server for accessing requested resources.

HTTP is an application level protocol for transferring resources across the Internet, e.g., a network data object or server, and it is specified by the URL. The Hyper Text Mark-up Language (HTML) is a simple data format that is used to create hypertext documents that are supported by the HTTP protocol. Together, these standards have contributed to create the World Wide Web (WWW) on the Internet. The WWW is a globally accessible and platform-independent hypermedia information system that has become a central access point to applications and services for people around the world.

A Cookie is a general mechanism, i.e., protocol, which server side connections can use to both store and retrieve information on the client side of the connection. The addition of a simple, persistent, client-side state significantly extends the capabilities of Internet-based client/server application programs. A server, when returning an HTTP object to a client, may also send a piece of state information which the client may store. Included in that state object is a description of the range of Uniform Resource Locators (URLs) for which the returned state is valid. Any future HTTP requests made by the client which fall in that range will include a transmittal of the current values of the state object from the client back to the sender. This state object is called a “Cookie,” for no compelling reason.

The Cookie mechanism provides a powerful tool that enables different types of application programs to be written for Internet-based environments. For example, a service program could use a Cookie to send back registration information and free the client from retyping a user identification number for each connection to the service. Also, an Internet site could store user preferences for a client and have the client supply those preferences each time that the client connected to the site.

Generally, a Cookie is introduced to the client by including information with a Set-Cookie command in a header as part of an HTTP response. An example of the Set-Cookie command included in an HTTP response header is listed below.

<HEADER>

Set-Cookie: NAME=VALUE; expires=DATE;

path=PATH; domain=DOMAIN NAME; secure

</HEADER>

When a client's browser program is requesting a URL from an HTTP server on the Internet, the browser will match the requested URL against all of the URLs stored in the client's Cookies. If the requested URL matches any of the stored URLs, a line containing the name/value pairs of all matching Cookies will be included in the HTTP request. An exemplary line in a Cookie for an HTTP request could be included as follows: Cookie: NAME1=OPAQUE STRING1; NAME2=OPAQUE STRING2.

A Cookie is typically used to save the state of a relationship between a client and a server. However, in some cases, the saved state of the relationship may create a load balancing problem. For example, each node server that is managed by a load balancing server array controller may not always share the same state relationship with the client that is saved in the Cookie. In this case, the controller must persistently send a repeated client HTTP request to the same node server because it is difficult to recreate the same state relationship in another server during the HTTP request/response session.

Although the saved state relationship in a Cookie can create a load balancing problem, the present invention uses the Cookie mechanism to offer a solution to this problem by enabling a network transmission device, e.g., a switch, Internet router, and/or a server array controller, to insert and/or examine Cookies in the data streams of HTTP connections for the purpose of reliably, conveniently and persistently directing connections to the same destination, e.g., a node server. Preferably, the network transmission device actively inserts data into or modifies the HTTP data stream from a server so that a Cookie can be saved by the client indicating the state relationship between the client and the server. In this way, the transmission device can use the Cookie returned in a subsequent client HTTP request to direct the current connection to the same server.

System Overview

FIG. 1A illustrates a system overview 100A of the data flow for an HTTP request/response for accessing resources associated with a domain name resolved into an Internet protocol (ip) address or an IP address that is directly provided by a client 10. In this example, the client 10 starts a session by connecting with an ISP 102 (located in Chicago, Ill.) over a communication medium. For example, the client may connect to the local ISP through a telephone modem, cable modem and/or satellite connection. The local ISP 102 usually provides a local domain name system (DNS) server 106 that communicates with other servers on the WAN for resolving a domain name request into an IP address when the client provides a domain name for accessing resources.

The client 10 sends an HTTP request 108A to the local ISP 102 for access to resources associated with an IP address that is either resolved or directly provided. A proxy server 104 will assign and add its first available virtual client IP address to the HTTP request 108A, so that the client 10 is identifiable during the current session. In the case where the HTTP request 108A identifies a domain name associated with the resource instead of an IP address, the local DNS server 106 employs a distributed database to resolve the domain name into the IP address for the requested resource.

The proxy server 104 sends the HTTP request 108A over the Internet 110 to a data center 112 located in Seattle, Wash., which is identified to be associated with the requested domain name (“domain.com”) or IP address. A router 114, (optional) firewall 116, server array controller 118 and an intranet of N node servers 120 are disposed at the data center 112. The server array controller 118 is used to manage and load balance network traffic on the intranet of node servers 120.

In one embodiment, the server array controller 118 intelligently distributes web site connections across arrays (pools) of node servers, transparent firewalls, transparent cache servers, routers as well as other router-like devices. The controller 118 may manage connections to multiple Internet or intranet sites, and it can support a wide variety of Internet protocols and services such as TCP/IP (transmission control protocol/Internet protocol) and HTTP. It is understood that the TCP/IP protocol actually represents a suite of communications protocols that are used to connect hosts on the Internet.

Each node server is defined by a specific combination of a node address and a node port number on the intranet behind the server array controller 118 which can monitor several aspects of the node servers. The controller 118 can load balance each connection to the intranet of node servers by selecting a unique IP address for a node server to provide optimal access to a requested resource.

The selected node server will provide access to the requested resource in an HTTP response that is sent by the server array controller 118 over the Internet 110 to the virtual client IP address at the Local ISP 102. The HTTP response includes a SET COOKIE command in the header of the response which includes information identifying the actual node server on the intranet behind the server array controller 118. The client accesses the requested resource in the HTTP response received from the Local ISP 102.

FIG. 1B illustrates a system overview 100B of substantially the same data stream flow as shown in FIG. 1A, except that the client 10 is providing an HTTP request 108B which includes a Cookie uniquely identifying a relationship between the previously selected node server and the client requesting access to the resource. While the Cookie is not expired, the server array controller 118 will automatically direct the HTTP request to the destination (node server) identified by the information in the Cookie. Thus, the server array controller 118 can use the information in the Cookie to reliably and efficiently load balance client demands for access to requested resources.

Logic Overview

In FIG. 2A, an overview 122 is shown of the general logic flow for an HTTP request that does not include a Cookie identifying the actual node server that will provide access to the requested resource. Moving from a start block, the logic steps to a block 124 where a TCP/IP handshake is performed between the client 10 and the server array controller 118 at the IP address provided by the client. Advancing to a block 126, the client 10 transmits the HTTP request to the server array controller 118 without a Cookie identifying the node server that will provide access to the requested resource.

Flowing to a block 128, the server array controller 118 makes a load balancing determination and selects the optimal node server to provide access to the requested resource and routes the HTTP request to the selected node server. The server array controller 118 may employ any one of several different types of load balancing methods to analyze metric information and optimally balance client HTTP requests (load demand). These load balancing methods include round trip time, round robin, least connections, packet completion rate, quality of service, server array controller packet rate, topology, global availability, hops, static ratio and dynamic ratio.

Stepping to a block 130, the selected node server generates an HTTP response that enables the client 10 to access the requested resource. The selected node server transmits the generated HTTP response to the server array controller 118 which retransmits the response to the client 10 along with information included with a SET COOKIE command that enables the particular IP address of the selected node server to be identified. Depending upon the mode of the present invention that is selected, the SET COOKIE command may be inserted in the header of the HTTP response by the server array controller 118 and/or the selected node server. Next, the logic moves to an end block and terminates.

FIG. 2B shows an overview 132 for processing an HTTP request that includes a Cookie with information that can be used to identify the destination that was previously selected to provide access to the requested resources. Moving from a start block, the logic steps to a block 134 where a TCP/IP handshake is performed between the client 10 and the server array controller 118 associated with the node server identified in the Cookie. Advancing to a block 136, the client 10 transmits the HTTP request to the server array controller 118 along with the Cookie and its information. The logic flows to a block 138 where the server array controller 118 uses the information included in the Cookie to route the HTTP request directly to the IP address of the node server that was previously selected to provide access to the requested resources.

Next, the logic moves to a block 140 where the selected node server generates an HTTP response for accessing the requested resources and provides this HTTP response to the server array controller 118. The controller 118 retransmits the HTTP response to the client 10 along with a SET COOKIE command that includes information that can be used to identify a relationship between the client and the destination (node server) that will provide access to the requested resources. The logic moves to an end block and terminates. The present invention thus enables the server array controller 118 to use the information in the Cookie to quickly, reliably and efficiently load balance client demands for access to requested resources.

Although not shown, another embodiment of the present invention enables the server array controller 118 to vary the expiration date of the time stamp included with HTTP requests and responses. When the load demand on the server array controller 118 increases, the controller may increase the period of time (expiration date) before the time stamp expires. Alternatively, when the load on the server array controller 118 decreases, the controller may decrease the period of time before the time stamp expires. By varying the expiration dates of the time stamps, the server array controller 118 may control the number of times that the controller performs load balancing determinations within a period of time. Also, when only a few destinations can provide access to the requested resource, the server array controller 118 may set the time stamp expiration date to one year or more.

The present invention provides at least four different modes of operation for inserting information in an HTTP response and examining Cookies in an HTTP request for uniquely identifying a relationship between the client and a selected destination such as a node server to provide access to the requested resources. These modes of operation include associative, passive, rewrite and insert.

Associative Mode

In FIG. 3A, an overview 142 of an “associative” mode for processing an HTTP response without a Cookie is illustrated. Moving from a start block, the logic steps to a block 144 where a TCP/IP handshake is performed between the client 10 and the server array controller 118 at the IP address provided by the client. Advancing to a block 146, the client 10 transmits the HTTP request to the server array controller 118.

The logic flows to a block 148 where the server array controller 118 receives the HTTP request and makes a load balancing determination to select the optimal node server to provide access to the requested resource. After selecting the optimal node server, the server array controller 118 routes the HTTP request to the selected node server.

The logic steps to a block 150 where the selected node server generates an HTTP response that provides access to the requested resource. The selected node server transmits the HTTP response to the server array controller 118. The server array controller 118 inserts a SET COOKIE command with information uniquely identifying the client 10 into the HTTP response's header. The controller 118 retransmits the HTTP response and the Cookie information to the client 10.

Alternatively, the selected node server may include the SET COOKIE command in the HTTP response's header with blank information. In this case, the server array controller 118 rewrites this blank information with information that uniquely identifies the client 10 and retransmits the “rewritten” HTTP response to the client.

Next, the logic flows to a block 152 where the server array controller 118 maps the identified client and the ip address of the selected node server into a table that is stored in the memory of the controller. The logic moves to an end block and terminates. Additionally, it is understood that the SET COOKIE command causes the client to store the Cookie information that uniquely identifies the client, so that when the same HTTP request is repeated by the client, this stored Cookie information will be used to create a Cookie that is included with the repeated HTTP request.

FIG. 3B shows an overview 154 of an “associative” mode for processing an HTTP request that includes a Cookie with information that can be used to identify the client 10. The Cookie for an HTTP request may be provided from a previous HTTP request/response exchange as illustrated in FIG. 3A above. It is also envisioned that another facility may be employed to create a Cookie for the HTTP request that includes information identifying the client 10. In this example, the information included in the Cookie enables the server array controller 118 to uniquely identify the client 10 and a mapped relationship to a previously selected node server. Moving from a start block, the logic steps to a block 156 where a TCP/IP handshake is performed between the client 10 and the server array controller 118 at an IP address provided by the client. Advancing to a block 158, the HTTP request along with the Cookie is transmitted from the client 10 to the server array controller 118.

The logic will move to a block 162 where the server array controller 118 will access the table held in its memory and identify the mapped relationship between the client and the previously selected node server for accessing the requested resources. Using the mapped relationship in the table, the controller 118 will provide the HTTP request to the previously selected node server. The logic flows to a block 168 where the node server generates an HTTP response which includes a SET COOKIE command with information that can be used to uniquely identify the client 10 requesting access to the resources at the IP address of the selected node server. The logic moves to a block 170 where the server array controller 118 updates another time stamp stored in the table which is associated with the mapping of the relationship between the client and the selected node server. Next, the logic moves to an end block and terminates.

Alternatively, in another embodiment, the node server could include a SET COOKIE command with blank information in the generated HTTP response. In this case, the server array controller 118 would rewrite the blank information to include other information that uniquely identifies the client 10 requesting access to the resources at the IP address of the selected node server.

In summary, the associative mode provides for inserting a Cookie into an HTTP response that uniquely identifies the client so that when a client's subsequent HTTP request is compared to a table, this subsequent HTTP request will be provided to a previously selected destination. The present invention thus enables the server array controller 118 to use the information in the Cookie to load balance client demands for access to requested resources. Additionally, it is understood that the associative mode puts most of the load for processing an HTTP request on the server array controller 118 relative to the load placed on a previously selected node server that is managed by the controller.

Passive Mode

In FIG. 4A, an overview 172 of a “passive” mode for processing an HTTP request/response is illustrated. Moving from a start block, the logic steps to a block 174 where a TCP/IP handshake is performed between the client 10 and the server array controller 118 at the IP address provided by the client. Advancing to a block 176, the client 10 transmits the HTTP request to the server array controller 118.

The logic flows to a block 178 where the server array controller 118 receives the HTTP request and makes a load balancing determination to select the optimal node server to provide access to the requested resource. After selecting the optimal node server, the server array controller 118 provides the HTTP request to the selected node server. The logic steps to a block 180 where the selected node server generates an HTTP response that includes Cookie information identifying the selected node server, i.e., a SET COOKIE command is inserted into the header of the HTTP response. The selected node server provides the HTTP response along with the inserted Cookie information to the server array controller 118. The server array controller 118 provides the HTTP response with the Cookie information to the client 10. Next, the logic moves to an end block and terminates. Additionally, it is understood that the SET COOKIE command causes the client to store Cookie information that identifies the previously selected destination, e.g., a node server, so that when the same HTTP request is repeated by the client, this stored Cookie information will be used to create a Cookie that is included with the repeated HTTP request.

FIG. 4B shows an overview 182 of a “passive” mode for processing an HTTP request that includes a Cookie with information identifying a previously selected node server for providing access to requested resources. Moving from a start block, the logic steps to a block 184 where a TCP/IP handshake is performed between the client 10 and the server array controller 118 at the IP address provided by the client. Advancing to a block 186, the HTTP request along with the Cookie is transmitted from the client 10 to the server array controller 118.

The logic moves to a block 190 where the server array controller 118 will use the information included in the Cookie to provide the HTTP request to the previously selected node server. The logic steps to a block 194 where the selected node server generates an HTTP response including Cookie information that identifies the selected node server. The selected node server provides the HTTP response with the Cookie information to the server array controller 118. The server array controller 118 retransmits the HTTP response with the Cookie information to the client 10. Next, the logic moves to an end block and terminates.

In summary, the passive mode provides for inserting Cookie information into an HTTP response that uniquely identifies a previously selected destination, such as a node server, so that when a client's subsequent HTTP request is examined, it can be efficiently provided to the previously selected destination. The present invention thus enables the server array controller 118 to use the information in the Cookie to load balance client demands for access to requested resources. Also, the passive mode puts most of the load for processing an HTTP request on a node server relative to the load placed on a server array controller 118 managing the node server.

Rewrite Mode

In FIG. 5A, an overview 200 of a “rewrite” mode for processing an HTTP response is illustrated. Moving from a start block, the logic steps to a block 202 where a TCP/IP handshake is performed between the client 10 and the server array controller 118 at the IP address provided by the client. Advancing to a block 204, the client 10 transmits the HTTP request to the server array controller 118.

The logic flows to a block 206 where the server array controller 118 receives the HTTP request and makes a load balancing determination to select the optimal node server to provide access to the requested resource. After selecting the optimal node server, the server array controller 118 routes the HTTP request to the selected node server. The logic steps to a block 208 where the selected node server generates an HTTP response that includes blank Cookie information, i.e., a SET COOKIE command is inserted into the header of the HTTP response without information identifying the selected node server. The selected node server provides the HTTP response with the blank Cookie information to the server array controller 118. The logic moves to a block 210 where the controller 118 rewrites the blank Cookie information to identify the node server selected to provide access to the requested resources. The server array controller 118 transmits the HTTP response and the rewritten Cookie information to the client 10. Next, the logic moves to an end block and terminates.

FIG. 5B shows an overview 212 of a “rewrite” mode for processing an HTTP request that includes a Cookie with information for identifying a node server previously selected to provide access to the requested resources. Moving from a start block, the logic steps to a block 214 where a TCP/IP handshake is performed between the client 10 and the server array controller 118 at an IP address provided by the client. Advancing to a block 216, the HTTP request along with the Cookie is transmitted from the client 10 to the server array controller 118.

The logic will move to a block 220 where the server array controller 118 will use the information included in the Cookie to identify the previously selected node server and route the HTTP request to this node server. The logic steps to a block 224 where the selected node server generates an HTTP response that includes blank Cookie information. The selected node server provides the HTTP response along with the inserted blank Cookie information to the server array controller 118. The logic steps to a block 226 where the server array controller 118 rewrites the blank Cookie information to include other information that identifies the selected node server. Next, the logic moves to an end block and terminates.

In the rewrite mode, the server array controller 118 manages the other “destination” information that is rewritten over the blank Cookie information. The rewrite mode roughly divides the load for processing an HTTP request/response between a server array controller 118 and a selected node server that is managed by the controller. The rewrite mode places a portion of this load on the selected node server to insert the blank Cookie in an HTTP response and another portion of this load on a server array controller 118 for rewriting the blank Cookie information to include other information that identifies the selected destination (node server). One advantage of the rewrite mode is that a plurality of node servers managed by the server array controller 118 may have the same content related to inserting blank Cookie information into an HTTP response. In this way, updates to the plurality of node servers are more easily provided because each node server can have the same content. Also, since the other information identifying the destination will occupy the same space as the blank Cookie information that was written over, the actual data packet containing the HTTP response does not have to change in size.

Insert Mode

In FIG. 6A, an overview 228 of an “insert” mode for processing an HTTP request/response is illustrated. Moving from a start block, the logic steps to a block 230 where a TCP/IP handshake is performed between the client 10 and the server array controller 118 at the IP address provided by the client. Advancing to a block 232, the client 10 transmits the HTTP request to the server array controller 118 at the primary IP address.

The logic flows to a block 234 where the server array controller 118 receives the HTTP request and makes a load balancing determination to select the optimal node server to provide access to the requested resource. The server array controller 118 provides the HTTP request to the selected node server. The logic steps to a block 236 where the selected node server generates an HTTP response and provides the generated HTTP response to the server array controller 118. The logic moves to a block 238 where the server array controller 118 rewrites the data packet(s) containing the HTTP response so that Cookie information identifying the node server selected to provide access to the requested resources can be inserted into the data packet. The logic flows to a block 240 where the server array controller 118 provides to the client 10 the rewritten data packet that includes the HTTP response and the inserted Cookie information. Next, the logic moves to an end block and terminates.

FIG. 6B shows an overview 242 of an “insert” mode for processing an HTTP request that includes a Cookie with information identifying a node server previously selected to provide access to the requested resources. Moving from a start block, the logic steps to a block 244 where a TCP/IP handshake is performed between the client 10 and the server array controller 118 at an IP address provided by the client. Advancing to a block 246, the HTTP request along with the Cookie is transmitted from the client 10 to the server array controller 118.

The logic will move to a block 250 where the server array controller 118 will use the information included in the Cookie to identify the previously selected node server. The server array controller 118 will rewrite the data packet(s) containing the HTTP response. The server array controller 118 will provide the rewritten data packet(s) containing the HTTP response to the client 10. The logic steps to a block 254 where the selected node server generates an HTTP response and provides the HTTP response to the server array controller 118. The logic moves to a block 256 where the server array controller 118 rewrites the data packet(s) containing the HTTP response to insert Cookie information into the response's header for identifying the node server selected to provide access to the requested resources. The logic flows to a block 258 where the server array controller 118 transmits to the client 10 a rewritten data packet that includes the HTTP response and the newly inserted Cookie information. Next, the logic moves to an end block and terminates.

The insert mode enables a server array controller 118 to load balance client demands for access to requested resources by inserting and removing Cookie information in the data packets for HTTP requests and responses prior to processing by the destination (selected node server). In the insert mode, all of the load for inserting and examining Cookie information and rewriting data packets is placed on the server array controller 118 and none of this load is put on the node servers managed by the controller.

Exemplary Cookie Code Fragments

In FIGS. 7A-7E, exemplary embodiments of HTML code fragments are shown that illustrate Cookie information included with an HTTP request/response. FIG. 7A shows an exemplary code fragment for an HTTP request 260 that includes Cookie information identifying a server that previously provided access to the requested resources. FIG. 7B illustrates an exemplary code fragment for an HTTP request 262 that contains blank Cookie information that does not identify the client 10 or a previously selected destination such as a node server. FIG. 7C shows an exemplary code fragment for an HTTP request 264 that includes Cookie information identifying the client 10 that previously requested access to the requested resources. FIG. 7D illustrates an exemplary code fragment for an HTTP response 266 that includes Cookie information identifying a server that previously provided access to the requested resources. FIG. 7E shows an exemplary code fragment for an HTTP response 267 that includes Cookie information identifying the client requesting access to the resources provided by the server.

Proxy Server Buffering

FIG. 8 illustrates an overview of how certain data packets between the client and a selected node server 274 are buffered and replayed by a server array controller's proxy server 270. A rectangular graphical representation of the proxy server 270 is vertically disposed along the center portion of this figure. Also, positioned along the center line of the developed length of the proxy server 270 is a time line 271 that starts with an initial value of “zero” near the top of the graphical representation of the proxy server. On the left side of the proxy server 270 is a graphical representation of a client 272 transmitting and receiving data packets along the developed length (and time line 271) of the proxy server. Similarly on the right side of the proxy server 270 is a graphical representation of a node server 274 transmitting and receiving data packets along the developed length of the proxy server.

Starting at the top left side of the figure, the client 10 is transmitting and receiving three separate groups of data packets with the proxy server 270. First, a TCP SYN 276A data packet is transmitted from the client 272 to the proxy server 270, which is followed by an exchange of TCP SYN/ACK.ACK 278A data packets. Next, an HTTP REQUEST 280A data packet is transmitted to the proxy server by the client.

All three groups of data packets are buffered and stored by the proxy server 270 until the HTTP REQUEST 280A is received by the proxy server. Then, the server array controller will examine the data packet(s) associated with the HTTP REQUEST 280A to determine if it includes Cookie information that identifies the client and/or a destination that previously provided access to the requested resources.

Once the Cookie determination is made, the proxy server 270 will sequentially replay the transmitting and receiving of the three groups of data packets with the selected node server 274. On the right side of the graphical representation of the proxy server 270, these three groups of data packets are replayed between the proxy server 270 and the node server 274. First, a TCP SYN 276B data packet is transmitted from the proxy server 270 to the node server 274, followed by an exchange of TCP SYN/ACK.ACK 278B data packets and next an HTTP REQUEST 280B data packet is transmitted to the node server 274 by the proxy server 270.

Moving further down the length of the graphical representation of the proxy server 270, a data packet(s) for an HTTP RESPONSE 282A is provided to the proxy server 270 by the selected node server 274. The proxy server 270 immediately replays this data packet to the client 272 in HTTP RESPONSE 282B. Next, the client 272 exchanges TCP FIN.ACK.FIN.ACK 2848 data packets with the proxy server 270. The proxy server 270 immediately replays these data packets to the node server 274 as TCP FIN.ACK.FIN.ACK 284A data packets.

It is important to note that the present invention only employs the proxy server 270 to buffer and store data packets until the HTTP request is received. Once the HTTP request is received, the proxy server will replay all of the buffered data packets for the selected node server 274 and switch to a forwarding mode for subsequent data packets, i.e., the proxy server will immediately replay all subsequent data packets transmitted by the client 272 to the selected node server.

System Configuration

FIG. 9 illustrates a system for the client 10 comprising components of a computer suitable for executing an application program embodying the present invention. In FIG. 5, a processor 12 is coupled bi-directionally to a memory 14 that encompasses read only memory (ROM) and random access memory (RAM). ROM is typically used for storing processor specific machine code necessary to bootup the computer comprising client 10, to enable input and output functions, and to carry out other basic aspects of its operation. Prior to running any application program, the machine language code comprising the program is loaded into RAM within memory 14 and then executed by processor 12. Processor 12 is coupled to a display 16 on which the visualization of the HTML response discussed above is presented to a user. Often, programs and data are retained in a nonvolatile memory media that may be accessed by a compact disk-read only memory (CD-ROM) drive, compact disk-read/write memory (CD-R/W) drive, optical drive, digital versatile disc (DVD) drive, hard drive, tape drive and floppy disk drive, all generally indicated by reference numeral 18 in FIG. 9. A network interface 22 couples the processor 12 to a wide area network such as the Internet.

As noted above, the present invention can be distributed for use on the computer system for the client 10 as machine instructions stored on a memory media such as a floppy disk 24 that is read by the floppy disk drive. The program would then typically be stored on the hard drive so that when the user elects to execute the application program to carry out the present invention, the machine instructions can readily be loaded into memory 14. Control of the computer and selection of options and input of data are implemented using input devices 20, which typically comprise a keyboard and a pointing device such as a mouse (neither separately shown). Further details of system for the client 10 and of the computer comprising it are not illustrated, since they are generally well known to those of ordinary skill in the art. Additionally, although not shown, computer systems for the node server 120 and the server array controller 118 could be configured in substantially the same way as the computer system for the client 10 illustrated here, albeit different in other ways.

Cookie Types

It is further envisioned that other types of Cookies may be used to identify a path that would be used to exchange data packets between the client and a destination such as a host machine, firewall, router or a node server managed by a server array controller. A “path” type of Cookie could be used to indicate the actual route and interim destinations that the data packets must use to travel between the client (source side) and the destination (supply side). For example, the path Cookie could indicate the individual routers that must be used to send data packets containing the HTTP requests and/or HTTP responses between the client and the destination.

A “hops” type of Cookie could be used to indicate an intermediate destination in the route the data packets must use to travel between the client and the destination. For example, a hops cookie could indicate a particular router that must always be used to send data packets containing the HTTP requests and/or HTTP responses between the client and the destination.

A “priority” type of Cookie may be used to indicate a priority for processing a data packet containing an HTTP request ahead of other data packets. Also, each priority Cookie could include a range of values indicating a level of priority. In this way, a data packet containing an HTTP request and a priority Cookie with the high priority value would be processed (sent) ahead of other data packets that contained HTTP requests and lower priority Cookies.

A “load balancing” Cookie could be used to indicate the load balancing method that the server array controller should perform to select the optimal node server to provide access to the resources when an HTTP request does not include a current Cookie with information identifying a destination. It is also envisioned that multiple types of Cookies and information could be included in HTTP requests and HTTP responses.

Additionally, it is envisioned that a unique identification of a client or a destination may be represented as encoded information in the Cookie. The result of an equation or a hash value may be used to encode the destination uniquely identified in the Cookie. A hash value (or simply hash) is a number generated from a string of text. The hash is substantially smaller than the text itself, and it is generated by a formula in such a way that it is extremely unlikely that some other text will produce the same hash value. Generally, the sender generates a hash of a message, encrypts the hash, and sends it with the message itself. The recipient then decrypts both the message and the hash, produces another hash from the received message, and compares the two hashes. If they are the same, there is a very high probability that the message was transmitted intact. A hash provides a quickly determinable value in the Cookie for identifying a relationship between the client and the destination.

An exemplary equation for directly determining the IP address of a selected node server (N) is as follows: ip4=N%256; ip3=((N−ip4)/256)%256; ip2=((N−ip4−ip3*256)/(256*256)%256; ip1=((N−ip4−ip3*256−ip2*256*256)/(256*256*256))%256;

Where the IP address for N=ip1*256*256*256+ip2*256*256+ip3*256+ip4.

While the preferred embodiment of the invention has been illustrated and described, it will be appreciated that various changes can be made therein without departing from the spirit and scope of the invention. 

1. A method for providing persistent communications between at least one client and a plurality of destination servers that serve a network domain resource, comprising: enabling at least one of the plurality of destination servers to receive application level protocol requests from a client, the application level protocol requests being forwarded to one of the plurality of destination servers with a network transmission device; receiving, at the one of the plurality of destination servers, a first application level protocol request from the client, the first application level protocol request comprising a request for access to the network domain resource provided by the destination server, wherein the client's first application level protocol request is unspecific regarding the one destination server that provides access to the network domain resource; sending, to the client, an application level protocol command to store data, the application level protocol command including information inserted by the network transmission device that indicates the one of the plurality of destination servers that received the first application level protocol request from the client; and receiving, at the one of the plurality of destination servers, a second application level protocol request from the client; wherein the information included with the application level protocol command is also included with the second application level protocol request and employable by the network transmission device to indicate the one of the plurality of destination servers.
 2. The method of claim 1, further comprising sending, to the client, a response to each application level protocol request.
 3. The method of claim 1, further comprising sending, to the client, a timestamp indicative of a time period during which the information is to be used.
 4. The method of claim 1, wherein the information comprises a specification of a node address and a node port number.
 5. The method of claim 1, wherein the application level protocol is the Hypertext Transfer Protocol (HTTP).
 6. The method of claim 1, wherein the application level protocol command to store data is a command to create a cookie.
 7. A method for providing persistent communications between at least one client and a plurality of destination servers, comprising: enabling at least one of the plurality of destination servers to receive from a client application level protocol requests, the application level protocol requests being forwarded to at least one of the plurality of destination servers with a network transmission device; receiving, at one of the plurality of destination servers, a first application level protocol request from the client, wherein the first application level protocol request comprises a request for access to a network domain resource served by the one of the plurality of destination servers, and wherein the client's first application level protocol request is unspecific regarding the one destination server that provides access to the network domain resource; and sending, to the client, with a first response to the first application level protocol request, an application level protocol command to store data, the application level protocol command including a designated space for insertion of information indicating the one of the plurality of destination servers by the network transmission device.
 8. The method of claim 7, further comprising: receiving, at the one of the plurality of destination servers, a second application level protocol request from the client; and sending, to the client, a second response to the second application level protocol request.
 9. The method of claim 7, wherein the space is sized to allow the network transmission device to rewrite the one of the plurality of destination servers particular identification data in the space and to maintain the size of the first response.
 10. The method of claim 7, wherein the application level protocol command is recognizable by the network transmission device as including the space, and wherein the space is overwritable.
 11. The method of claim 7, wherein the one of the plurality of destination servers maintains content relating to including the space in the application level protocol command to store data, the content being substantially the same as content maintained on at least one other of the plurality of destination servers.
 12. The method of claim 7, wherein the application level protocol command is provided without information identifying any one of the plurality of destination servers.
 13. The method of claim 7, wherein the application level protocol is the Hypertext Transfer Protocol (HTTP).
 14. The method of claim 7, wherein the application level protocol command to store data is a command to create a cookie.
 15. A device for enabling communication between at least one of a plurality of servers and at least one client in a network, comprising: a network interface for communicating application level messages between at least one client and at least one of a plurality of servers; and a processor for enabling actions, including: enabling at least one of the plurality of servers to send an application level response to an application level request from a client for access to a network domain resource served by at least one of the plurality of servers, wherein the client's application level request is unspecific regarding the at least one server that provides access to the network domain resource; employing the device to add information within an application level protocol command to the application level response and send the application level response along with the added information to the client, wherein the added information indicates the one of the plurality of servers that responded to the application level request from the client for access to the network domain resource; and if the added information is included with another application level request from the client, employing the added information to process the other application level request for access to the network domain resource.
 16. The device of claim 15, further comprising employing the device to determine the one of the plurality of servers for responding to the other application level request based at least in part on the added information.
 17. The device of claim 15, further comprising employing the device to add information by overwriting at least a portion of the application level response.
 18. The device of claim 15, further comprising employing the device to add information by inserting the information in the application level response.
 19. The device of claim 15, wherein the device is operational as at least one of a router, a load balancer, a switch, a server array controller, a server, and an intermediate network device.
 20. The device of claim 15, wherein the application level messages are operational with at least the Hypertext Transport Protocol (HTTP).
 21. The device of claim 15, further comprising enabling at least one of the plurality of servers to set information included in the application level response that indicates at least one of the server and the client.
 22. The device of claim 15, further comprising employing the device to set the added information to indicate at least one of the plurality of servers and the client.
 23. A device for enabling communication between at least one of a plurality of servers and at least one client over a network, comprising: a network interface for communicating over the network; and a processor for enabling actions, including: providing a request from a client to at least one of the plurality of servers to access a network domain resource, wherein persistence information is included by the device in a response to the request from one of the plurality of servers that serves the network domain resource, and wherein the persistence information indicates the server that responded to the request from the client, and wherein the client's request is unspecific regarding the at least one server that serves access to the network domain resource; examining the request for persistence information included in the response; forwarding the response with the persistence information to the client; and if a subsequent request from the client includes the persistence information that indicates the server that previously responded to the request from the client, directing the request to the indicated one of the plurality of servers.
 24. The device of claim 23, further comprising including information along with the persistence information that identifies at least one of the one server and the client.
 25. The device of claim 23, further comprising storing at least the persistence information in response to receiving the response from the one of the plurality of servers.
 26. A computer readable storage medium, comprising executable instructions for enabling communication between at least one of a plurality of servers and at least one client in a network, the actions comprising: enabling at least one of the plurality of destination servers to receive application level protocol requests from a client, the application level protocol requests being forwarded to one of the plurality of destination servers with a network transmission device; receiving, at the one of the plurality of destination servers, a first application level protocol request from the client, the first application level protocol request comprising a request for access to the network domain resource provided by the destination server, and wherein the client's first application level protocol request is unspecific regarding the at least one destination server that provides access to the network domain resource; sending, to the client, an application level protocol command to store data, the application level protocol command including information inserted by the network transmission device that indicates the one of the plurality of destination servers that received the first application level protocol request from the client; and receiving, at the one of the plurality of destination servers, a second application level protocol request from the client; wherein the information included with the application level protocol command is also included with the second application level protocol request and employable by the network transmission device to indicate the one of the plurality of destination servers.
 27. The computer readable storage medium of claim 26, wherein the actions further comprise sending, to the client, a response to each application level protocol request.
 28. The computer readable storage medium of claim 26, wherein the actions further comprise sending, to the client, a timestamp indicative of a time period during which the information is to be used.
 29. The computer readable storage medium of claim 26, wherein the information comprises a specification of a node address and a node port number.
 30. The computer readable storage medium of claim 26, wherein the application level protocol is the Hypertext Transfer Protocol (HTTP).
 31. The computer readable storage medium of claim 26, wherein the application level protocol command to store data is a command to create a cookie.
 32. A system for providing persistent communication over a network for a network domain resource, comprising: a plurality of destination servers; and a network device, including: at least one memory for storing instructions; and at least one processor for executing the instructions, the executed instructions enabling actions, including: enabling at least one of the plurality of destination servers to receive application level protocol requests from a client, the application level protocol requests being forwarded to one of the plurality of destination servers with a network transmission device; receiving, at the one of the plurality of destination servers, a first application level protocol request from the client, the first application level protocol request comprising a request for access to the network domain resource provided by the destination server, and wherein the client's first application level protocol rawest is unspecific regarding the at least one destination server that provides access to the network domain resource; sending, to the client, an application level protocol command to store data, the application level protocol command including information inserted by the network transmission device that indicates the one of the plurality of destination servers that received the first application level protocol request from the client; and receiving, at the one of the plurality of destination servers, a second application level protocol request from the client; wherein the information included with the application level protocol command is also included with the second application level protocol request and employable by the network transmission device to indicate the one of the plurality of destination servers.
 33. The system of claim 32, wherein the actions further comprise sending, to the client, a response to each application level protocol request.
 34. The system of claim 32, wherein the actions further comprise sending, to the client, a timestamp indicative of a time period during which the information is to be used.
 35. The system of claim 32, wherein the information comprises a specification of a node address and a node port number.
 36. The system of claim 32, wherein the application level protocol is the Hypertext Transfer Protocol (HTTP).
 37. The system of claim 32, wherein the application level protocol command to store data is a command to create a cookie.
 38. A computer readable storage medium, comprising executable instructions for performing actions for providing persistent communications between at least one client and a plurality of destination servers, the actions comprising: enabling at least one of the plurality of destination servers to receive from a client application level protocol requests, the application level protocol requests being forwarded to at least one of the plurality of destination servers with a network transmission device; receiving, at one of the plurality of destination servers, a first application level protocol request from the client, wherein the first application level protocol request comprises a request for access to a network domain resource served by the one of the plurality of destination servers, and wherein the client's first application level protocol request is unspecific regarding the at least one destination server that provides access to the network domain resource; and sending, to the client, with a first response to the first application level protocol request, an application level protocol command to store data, the application level protocol command including a designated space for insertion of information indicating the one of the plurality of destination servers by the network transmission device.
 39. The computer readable storage medium of claim 38, wherein the actions further comprise: receiving, at the one of the plurality of destination servers, a second application level protocol request from the client; and sending, to the client, a second response to the second application level protocol request.
 40. The computer readable storage medium of claim 38, wherein the space is sized to allow the network transmission device to rewrite the one of the plurality of destination servers particular identification data in the space and to maintain the size of the first response.
 41. The computer readable storage medium of claim 38, wherein the application level protocol command is recognizable by the network transmission device as including the space, and wherein the space is overwritable.
 42. The computer readable storage medium of claim 38, wherein the one of the plurality of destination servers maintains content relating to including the space in the application level protocol command to store data, the content being substantially the same as content maintained on at least one other of the plurality of destination servers.
 43. The computer readable storage medium of claim 38, wherein the application level protocol command is provided without information identifying any one of the plurality of destination servers.
 44. The computer readable storage medium of claim 38, wherein the application level protocol is the Hypertext Transfer Protocol (HTTP).
 45. The computer readable storage medium of claim 38, wherein the application level protocol command to store data is a command to create a cookie.
 46. A system for providing persistent communication over a network for a network domain resource, comprising: a plurality of destination servers; and a network device, including: at least one memory for storing instructions; and at least one processor for executing the instructions, the executed instructions enabling actions, including: enabling at least one of the plurality of destination servers to receive from a client application level protocol requests, the application level protocol requests being forwarded to at least one of the plurality of destination servers with the network transmission device; receiving, at one of the plurality of destination servers, a first application level protocol request from the client, wherein the first application level protocol request comprises a request for access to a network domain resource served by the one of the plurality of destination servers, and wherein the client's first application level protocol request is unspecific regarding the at least one destination server that provides access to the network domain resource; and sending, to the client, with a first response to the first application level protocol request, an application level protocol command to store data, the application level protocol command including a designated space for insertion of information indicating the one of the plurality of destination servers by the network transmission device.
 47. The system of claim 46, wherein the actions further comprise: receiving, at the one of the plurality of destination servers, a second application level protocol request from the client and sending, to the client, a second response to the second application level protocol request.
 48. The system of claim 46, wherein the space is sized to allow the network transmission device to rewrite the one of the plurality of destination servers particular identification data in the space and to maintain the size of the first response.
 49. The system of claim 46, wherein the application level protocol command is recognizable by the network transmission device as including the space, and wherein the space is overwritable.
 50. The system of claim 46, wherein the one of the plurality of destination servers maintains content relating to including the space in the application level protocol command to store data, the content being substantially the same as content maintained on at least one other of the plurality of destination servers.
 51. The system of claim 46, wherein the application level protocol command is provided without information identifying any one of the plurality of destination servers.
 52. The system of claim 46, wherein the application level protocol is the Hypertext Transfer Protocol (HTTP).
 53. The system of claim 46, wherein the application level protocol command to store data is a command to create a cookie. 